Origin and Length Distribution of Unidirectional Prokaryotic Overlapping Genes
نویسندگان
چکیده
Prokaryotic unidirectional overlapping genes can be originated by disrupting and replacing of the start or stop codon of one protein-coding gene with another start or stop codon within the adjacent gene. However, the probability of disruption and replacement of a start or stop codon may differ significantly depending on the number and redundancy of the start and stop codons sets. Here, we performed a simulation study of the formation of unidirectional overlapping genes using a simple model of nucleotide change and contrasted it with empirical data. Our results suggest that overlaps originated by an elongation of the 3'-end of the upstream gene are significantly more frequent than those originated by an elongation of the 5'-end of the downstream gene. According to this, we propose a model for the creation of unidirectional overlaps that is based on the disruption probabilities of start codon and stop codon sets and on the different probabilities of phase 1 and phase 2 overlaps. Additionally, our results suggest that phase 2 overlaps are formed at higher rates than phase 1 overlaps, given the same evolutionary time. Finally, we propose that there is no need to invoke selection to explain the prevalence of long phase 1 unidirectional overlaps. Rather, the overrepresentation of long phase 1 relative to long phase 2 overlaps might occur because it is highly probable that phase 2 overlaps are retained as short overlaps by chance. Such a pattern is stronger if selection against very long overlaps is included in the model. Our model as a whole is able to explain to a large extent the empirical length distribution of unidirectional overlaps in prokaryotic genomes.
منابع مشابه
Varicella Zoster Virus (VZV) Origin-Dependent Plasmid Replication in the Presence of the Four Overlapping Cosmids Comprising the Complete Genome of VZV
The Varicella-Zoster Virus (VZV) genome contains both cis-acting and trans-acting elements, which are important in viral DNA replication. The cis-acting elements consist of two copies of oriS, and the trans-acting elements are those genes whose products are required for virus DNA replication. It has been shown that each of the seven genes required for ori-dependent DNA synthesis of Herpes Simpl...
متن کاملA Novel Prokaryotic Promoter Identified in the Genome of Some Monopartite Begomoviruses
Geminiviruses are known to exhibit both prokaryotic and eukaryotic features in their genomes, with the ability to express their genes and even replicate in bacterial cells. We have demonstrated previously the existence of unit-length single-stranded circular DNAs of Ageratum yellow vein virus (AYVV, a species in the genus Begomovirus, family Geminiviridae) in Escherichia coli cells, which promp...
متن کاملA unified model explaining the offsets of overlapping and near-overlapping prokaryotic genes.
Overlapping genes are a common phenomenon. Among sequenced prokaryotes, more than 29% of all annotated genes overlap at least 1 of their 2 flanking genes. We present a unified model for the creation and repair of overlaps among adjacent genes where the 3' ends either overlap or nearly overlap. Our model, derived from a comprehensive analysis of complete prokaryotic genomes in GenBank, explains ...
متن کاملPolymerization of non-complementary RNA: Systematic symmetric nucleotide exchanges mainly involving uracil produce mitochondrial RNA transcripts coding for cryptic overlapping genes
Usual DNA→RNA transcription exchanges T→U. Assuming different systematic symmetric nucleotide exchanges during translation, some GenBank RNAs match exactly human mitochondrial sequences (exchange rules listed in decreasing transcript frequencies): C↔U, A↔U, A↔U+C↔G (two nucleotide pairs exchanged), G↔U, A↔G, C↔G, none for A↔C, A↔G+C↔U, and A↔C+G↔U. Most unusual transcripts involve exchanging ur...
متن کاملOGtree: a tool for creating genome trees of prokaryotes based on overlapping genes
OGtree is a web-based tool for constructing genome trees of prokaryotic species based on a measure of combining overlapping-gene content and overlapping-gene order in their whole genomes. The overlapping genes (OGs) are defined as adjacent genes whose coding sequences overlap partially or entirely. In fact, OGs are ubiquitous in microbial genomes and more conserved between species than non-OGs....
متن کامل